Don't type check most function bodies if ignoring errors #14150

JukkaL · 2022-11-19T14:17:51Z

If errors are ignored, type checking function bodies often can have no effect. Remove function bodies after parsing to speed up type checking.

Methods that define attributes have an externally visible effect even if errors are ignored. The body of any method that assigns to any attribute is preserved to deal with this (even if it doesn't actually define a new attribute). Most methods don't assign to an attribute, so stripping bodies is still effective for methods.

There are a couple of additional interesting things in the implementation:

We need to know whether an abstract method has a trivial body (e.g. just ...) to check super() method calls. The approach here is to preserve such trivial bodies and treat them differently from no body at all.
Stubgen analyzes the bodies of functions to e.g. infer some return types. As a workaround, explicitly preserve full ASTs when using stubgen.

The main benefit is faster type checking when using installed packages with inline type information (PEP 561). Errors are ignored in this case, and it's common to have a large number of third-party code to type check. For example, a self check (code under mypy/) is now about 20% faster, with a compiled mypy on Python 3.11.

Another, more subtle benefit is improved reliability. A third-party library may have some code that triggers a mypy crash or an invalid blocking error. If bodies are stripped, often the error will no longer be triggered, since the amount code to type check is much lower.

ilevkivskyi

Nice! I like the idea, left couple comments (not a full review).

ilevkivskyi · 2022-11-19T15:02:33Z

mypy/fastparse.py

+        self.lvalue = False
+        self.found = False
+
+    def visit_assignment_stmt(self, s: AssignmentStmt) -> None:


Do you also need to check assignment expression? (i.e. walrus a.k.a :=)

It doesn't support assigning to an attribute.

ilevkivskyi · 2022-11-19T15:04:41Z

mypy/fastparse.py

+        ):
+            # We only strip method bodies if they don't assign to an attribute, as
+            # this may define an attribute which has an externally visible effect.
+            visitor = FindAttributeAssign()


Would it be possible to strip statements after last assignment to attribute? This could make a bit more perf gain.

This may be possible! I'd rather not do it in this PR to keep it simple (and the impact is probably pretty minor), but it's seems like a promising follow-up improvement to investigate.

cdce8p

Got a chance to test the PR on a full run with Home Assistant (with all dependencies installed). The performance improvement is noticeable. Both with the uncompiled version on Python 3.9 - perviously: ~11min - now: ~7:30min 🚀

Unfortunately, I also got a few new error messages which the primer didn't pick up on.

homeassistant/components/nibe_heatpump/__init__.py:267: error: "Coroutine[Any, Any, AsyncIterator[Coil]]" has no attribute "__aiter__" (not async iterable)  [attr-defined]
homeassistant/components/nibe_heatpump/__init__.py:267: note: Maybe you forgot to use "await"?
homeassistant/components/google/calendar.py:350: error: "Coroutine[Any, Any, AsyncIterator[ListEventsResponse]]" has no attribute "__anext__"  [attr-defined]
homeassistant/components/google/calendar.py:350: note: Maybe you forgot to use "await"?
homeassistant/components/homekit_controller/config_flow.py:155: error: "Coroutine[Any, Any, AsyncIterable[AbstractDiscovery]]" has no attribute "__aiter__" (not async iterable)  [attr-defined]
homeassistant/components/homekit_controller/config_flow.py:155: note: Maybe you forgot to use "await"?
homeassistant/components/amcrest/__init__.py:209: error: Return type "_AsyncGeneratorContextManager[Response]" of "async_stream_command" incompatible with return type "_AsyncGeneratorContextManager[<nothing>]" in supertype "Http"  [override]
homeassistant/components/amcrest/__init__.py:214: error: Need type annotation for "ret"  [var-annotated]
Found 5 errors in 4 files (checked 5433 source files)

One of the edge cases seem to be async iterators with yield. To reproduce it, create two files

# a.py
from typing import AsyncIterator


class L:
    async def some_func(self, i: int) -> str:
        return str(i)

    async def get_iterator(self) -> AsyncIterator[str]:
        for i in range(5):
            yield await self.some_func(i)

# b.py
from a import L

async def func(l: L) -> None:
    reveal_type(l.get_iterator)
    async for i in l.get_iterator():
        print(i)

And run mypy b.py.

b.py:5: note: Revealed type is "def () -> typing.Coroutine[Any, Any, typing.AsyncIterator[builtins.str]]"
b.py:6: error: "Coroutine[Any, Any, AsyncIterator[str]]" has no attribute "__aiter__" (not async iterable)  [attr-defined]
b.py:6: note: Maybe you forgot to use "await"?

cdce8p · 2022-11-20T16:24:45Z

The second edge case is fairly similar, just with the addition of @asynccontextmanager.

# a.py
from contextlib import asynccontextmanager
from typing import AsyncIterator

class Parent:
    @asynccontextmanager
    async def async_func(self) -> AsyncIterator[str]:
        yield ''

# b.py
from contextlib import asynccontextmanager
from typing import AsyncIterator

from test8 import Parent

class Child(Parent):
    @asynccontextmanager
    async def async_func(self) -> AsyncIterator[str]:
        yield ''

Running mypy b.py

b.py:9: error: Return type "_AsyncGeneratorContextManager[str]" of "async_func" incompatible with return type "_AsyncGeneratorContextManager[<nothing>]" in supertype "Parent"  [override]

JukkaL · 2022-11-21T16:39:08Z

@cdce8p Thanks for the detailed reports about regressions! I clearly need to investigate those cases.

These can have an externally visible effect within an async function. The existence of yield affects the inferred return type, and a yield from generates a blocking error.

cdce8p · 2022-12-04T13:39:20Z

@cdce8p Thanks for the detailed reports about regressions! I clearly need to investigate those cases.

Just tested the PR with Home Assistant again. The last changes resolve all issues 🎉

JukkaL · 2022-12-08T17:03:31Z

There are some performance regressions on master that I'd like to address first before merging this. Otherwise the performance impact estimate might be inflated.

hauntsaninja · 2023-01-25T20:35:06Z

@JukkaL I just fixed conflicts. What's the status of this PR? Are you ready to re-measure perf / merge in time for 1.0?

christianbundy · 2023-02-06T17:50:31Z

Just wanted to quickly note that I think this will be very valuable for codebases that are migrating from untyped Python to typed Python. We use # mypy: ignore-errors in most test files, and excluding those files made our type-checking 300% as fast (i.e. our type-check time was reduced 66%, from 6 minutes to 2 minutes). I didn't test this PR, but I'm using this script which may have similar behavior.

# Exclude files with '# mypy: ignore-errors' for performance
# - Requires GNU Grep (tested with 3.8)
# - Ignores the same directories as Mypy https://mypy.readthedocs.io/en/stable/command_line.html#cmdoption-mypy-exclude
# - Also ignores hidden directories like `.git` and `.mypy_cache`
# - Does not support unnecessary whitespace in comment
exclude="$(\
    grep \
    --binary-files without-match \
    --recursive \
    --include '*.py'  \
    --include '*.pyi' \
    --exclude-dir 'site-packages' \
    --exclude-dir 'node_modules' \
    --exclude-dir '__pycache__' \
    --exclude-dir '.*' \
    --files-with-matches \
    '^# mypy: ignore[-_]errors$' \
    | sed 's/^\.\///g' \
    | tr '\n' '|' \
    | sed 's/\./\\\./g' \
    | sed 's/|$//'\
)"

mypy --exclude "$exclude"

github-actions · 2023-04-24T15:13:46Z

According to mypy_primer, this change has no effect on the checked open source code. 🤖🎉

JukkaL · 2023-04-24T15:39:30Z

I finally got around to running some benchmarks. Self-checking mypy (not including mypyc) was about 10% faster when compiled, and 15% faster when interpreted.

JukkaL added 21 commits November 11, 2022 13:24

WIP

200bd45

more WIP

5191aa2

WIP fix ignoring errors

a19c4da

Tweak docstring

75077e8

Add some parser tests

88a1a58

Handle more kinds of bodies

9a1755f

Improve test case

83abaf8

Add tests

f838466

Clean up tests

13d68cf

WIP add failing test case

a64e92e

Preserve trivial bodies and nested blocks

20b4582

Remove debug print

89ddfd7

Update test case

45737b5

Cleanup

5384716

Black

ad63379

isort

24519fe

Fix tests

79ab207

Minor test update

0a8e69d

Add docstring

3b8fc66

Fix stubgen tests

683237f

Minor tweaks

aa31f2c

This comment has been minimized.

Sign in to view

ilevkivskyi reviewed Nov 19, 2022

View reviewed changes

cdce8p reviewed Nov 20, 2022

View reviewed changes

JukkaL added 3 commits December 4, 2022 11:26

Merge branch 'master' into strip-bodies

bcc7cd6

Fix handling of yield and yield from

bc23a92

These can have an externally visible effect within an async function. The existence of yield affects the inferred return type, and a yield from generates a blocking error.

Fix type check

a94039d

This comment has been minimized.

Sign in to view

jhance approved these changes Dec 5, 2022

View reviewed changes

AlexWaygood added the performance label Dec 11, 2022

AlexWaygood mentioned this pull request Jan 2, 2023

Use --additional-flags='check-untyped-defs' when running mypy_primer python/typeshed#9433

Merged

Merge branch 'master' into strip-bodies

bec85a1

This comment has been minimized.

Sign in to view

hauntsaninja mentioned this pull request Jan 26, 2023

Release 1.0 planning #13685

Closed

17 tasks

Merge branch 'master' into strip-bodies

2e1ac69

JukkaL merged commit aee983e into master Apr 24, 2023

JukkaL deleted the strip-bodies branch April 24, 2023 15:39

JukkaL mentioned this pull request Apr 24, 2023

mypy 0.990 is 1000x slow on pydantic codebase vs 0.982 #14034

Closed

hauntsaninja mentioned this pull request Jun 21, 2023

[1.4] Regression with AsyncGenerator #15489

Closed

ajeklund mentioned this pull request Jul 7, 2023

[Auto-generated] Update dependencies EMMC-ASBL/oteapi-dlite#149

Merged

2 tasks

hauntsaninja mentioned this pull request Aug 13, 2023

Avoid type checking dependencies when errors aren't reported #12854

Open

k4nar mentioned this pull request Aug 28, 2023

Mypy: @declared_attr crash mypy when using "--follow-imports=silent" sqlalchemy/sqlalchemy#10282

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't type check most function bodies if ignoring errors #14150

Don't type check most function bodies if ignoring errors #14150

JukkaL commented Nov 19, 2022

This comment has been minimized.

ilevkivskyi left a comment

ilevkivskyi Nov 19, 2022

JukkaL Nov 19, 2022

ilevkivskyi Nov 19, 2022

JukkaL Nov 19, 2022

cdce8p left a comment •

edited

Loading

cdce8p commented Nov 20, 2022

JukkaL commented Nov 21, 2022

This comment has been minimized.

cdce8p commented Dec 4, 2022

JukkaL commented Dec 8, 2022

hauntsaninja commented Jan 25, 2023

This comment has been minimized.

christianbundy commented Feb 6, 2023 •

edited

Loading

github-actions bot commented Apr 24, 2023

JukkaL commented Apr 24, 2023

Don't type check most function bodies if ignoring errors #14150

Don't type check most function bodies if ignoring errors #14150

Conversation

JukkaL commented Nov 19, 2022

This comment has been minimized.

ilevkivskyi left a comment

Choose a reason for hiding this comment

ilevkivskyi Nov 19, 2022

Choose a reason for hiding this comment

JukkaL Nov 19, 2022

Choose a reason for hiding this comment

ilevkivskyi Nov 19, 2022

Choose a reason for hiding this comment

JukkaL Nov 19, 2022

Choose a reason for hiding this comment

cdce8p left a comment • edited Loading

Choose a reason for hiding this comment

cdce8p commented Nov 20, 2022

JukkaL commented Nov 21, 2022

This comment has been minimized.

cdce8p commented Dec 4, 2022

JukkaL commented Dec 8, 2022

hauntsaninja commented Jan 25, 2023

This comment has been minimized.

christianbundy commented Feb 6, 2023 • edited Loading

github-actions bot commented Apr 24, 2023

JukkaL commented Apr 24, 2023

cdce8p left a comment •

edited

Loading

christianbundy commented Feb 6, 2023 •

edited

Loading